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1.  Introduction 


Molecular  inverse  design  beyond  the  purview  of  drug  design  has  of  late  enjoyed  increasing 
popularity.  Examples  can  be  found  in  protein  design  (1  4)  or  high- hyperpolarizability 
materials  (5-7).  The  design  problem  is  complicated  by  the  vastness  of  possible  chemicals, 
termed  chemical  space.  This  space  can  be  viewed  as  combinatorially  complex,  e.g.,  there 
are  208  ~  2.6  •  1010  octa-peptides  of  the  naturally  occurring  amino  acids  alone.  As  a 
consequence,  a  variety  of  methods  have  been  developed  for  discrete  optimization  in 
chemical  subspaces  (8-14)-  All  these  methods  restrict  the  search  to  a  predetermined 
subspace,  often  focused  around  molecular  scaffolds.  While  this  is  convenient  for  reduced 
computational  cost,  it  represents  a  bias,  which  precludes  the  discovery  of  unsuspected  and 
truly  novel  solutions  to  the  design  problem,  but  also  requires  a  priori  knowledge  to  be 
successful.  Instead  of  restricting  the  space  explicitly,  a  “functional”  description  of  the 
search  space  could  remove  the  bias.  Such  a  functional  description  must  encode  the  target 
properties  and  constraints  of  the  problem  in  a  fashion  that  is  amenable  to  computation. 

The  continuous  optimization  of  chemicals  used  in  the 
linear-combination-of-atomic-potentials  (LCAP)  method  (15,  16)  and  the 
variation-of-particles  density-functional-theoretical  (VP-DFT)  method  introduces  an 
important  concept  to  overcoming  the  bias  efficiently  (17-19).  LCAP  and  VP-DFT 
interpolate  continuously  between  the  Hamiltonians  of  various  chemical  species. 
Furthermore,  recently  an  investigation  into  the  reasons  why  chemical  optimization  is, 
relatively  speaking,  “easy”  used  probability  distributions  and  expectations  on  the  control 
variables  to  arrive  at  its  conclusion  (20,  21).  It  is  this  general  idea  that  is  extended  and 
developed  in  the  following,  employing  distributions  of  nuclear  charges  and  associated 
potentials.  Since  the  analytical  form  of  such  functionals  is  not  generally  known,  we  derive 
approximate  functionals  for  the  optimization  of  an  electronic  property  for  a  molecule  in  the 
ground  state  (which  is  applied  as  a  constraint),  outlining  a  procedure  for  the  development 
of  other  functionals. 


2.  Distributions  as  Optimization  Variables 


Our  goal  is  to  develop  an  optimization  algorithm  of  chemical  species  based  on  non-negative 
distributions  only.  In  order  to  be  useful,  it  is  required  that  the  result  can  be  identified 
(preferably  uniquely)  with  a  specific  chemical.  As  a  consequence,  it  is  necessary  to  describe 
the  properties  of  interest,  the  targets,  and  the  constraints,  as  functionals  in  terms  of  these 
distributions.  This  is  accomplished  via  probability  distributions  on  chemical  space. 
Furthermore,  for  properties  with  proper  optima,  the  existence  of  and  convergence  on  global 
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optima  are  proven  for  the  exact  functionals.  We  begin  by  defining  rigorously  the  terms 
chemical  space,  conformational  space,  and  configurational  space  as  used  later. 

Definition  1.  The  compositional  space  C,  i.e.,  the  space  of  chemical  compounds,  is 
identified  with  the  set  of  descendingly  ordered  finite  sequences  in  N.  Each  non-zero  entry  of 
the  sequence  denotes  the  charge  of  an  atomic  nucleus. 

For  example,  lithium  hydride  (LiH)  corresponds  to  (3, 1)  and  beryllium  hydride  (Behh)  to 
(4, 1, 1).  Due  to  the  common  restriction  of  charge,  the  charge  is  considered  a  separate 
property  derivable  from  the  composition  and  problem  setting  rather  than  an  inherent 
component  of  the  chemical  definition,  although  it  is  possible  to  augment  this  picture 
without  altering  the  general  applicability  of  the  following  results.  This  choice  of  identifying 
the  composition  of  chemicals  reflects  common  usage  in  chemistry  as  well  as  providing  a 
unique  mathematical  object  for  each  such  composition. 

Definition  2.  The  configurational  space  T  is  the  set  of  finite  sequences  ((qi,  Ri))iei  in 
N  x  M3  of  strictly  desceiiding  lexicographical  order  (i  <  j  if  qi  >  qj  or  qi  =  qj,  Rf  =  Rf  for 
0  <  v  <  /i  <  3  and  Rf  <  R'i-).  The  coordinate  system  is  chosen  to  conform  to  the  rules: 

1.  The  coordinate  system  is  right-handed  and  comprised  of  principal  axes  through  the 
center  of  mass. 

2.  If  the  coordinate  system  is  ambiguous,  then  the  axes  are  chosen  to  be  a  right-handed 
system  such  that  the  resultant  sequence  is  minimal  with  respect  to  lexicographical 
order  between  sequences. 

The  second  entry  in  each  couplet  is  the  position  of  the  respective  atom  in  real  space. 
Ambiguous  coordinate  systems  may  arise  from  high  symmetry  groups,  which  necessitates 
the  second  rule  in  defining  the  coordinate  system.  Now,  let  7  :  T  — >  C  map  a  configuration 
to  its  respective  composition,  i.e.,  ((qi,  Ri))iei  (qfjiei- 

Definition  3.  The  conformational  space  Dc  of  a  composition  c  is  the  pre-image 
7 _1(c)  =  Tlc  C  (R3)^c\  where  1(c)  is  the  length  of  the  sequence  c. 

For  example,  LiH  corresponds  to 

H uh  —  {(a,  y,  z)  x  (u,  v,  w)  :  x  >  u  V  (x  —  u  A  y  >  v)  V  (x  —  u  A  y  —  v  A  z  >  -in)}  C  M3  x  M3. 

Each  of  the  preceding  definitions  refines  the  general  notions  of  chemistry  rigorously,  which 
allows  for  well-defined  treatment  and  analysis  of  chemical  properties.  We  introduce  the 
following  terminology  to  simplify  expressions. 

•  Let  C  :  N  — »•  C  be  an  enumeration  of  compositions. 
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•  The  nuclear  charge  of  C(i)  is  ||C'(i)||i  =  e.g.,  for  LiH,  the  charge  is  4. 

•  Let  Pi  :  7 -1  ((?(*))  =  Ctcn)  ->Kbea  probability  distribution  on  the  conformational 
space  of  composition  C(i),  i.e.,  Pt(x)  >  0\/x  G  hlcp)  and  f  Pid/i(Qc(i ))  =  1  with 
measure  /j  on  h2c(i)- 

•  Let  p' :  C  ->  R  be  a  probability  distribution  on  compositional  space,  which  is 
equivalent  to  p  :  N  — >  M,p(i)  =  p'((7(i)),  i.e.,  p(i)  >  OVi  G  N  and  ^AeNp(i)  =  1. 

For  a  probability  distribution  p  :  T  — >  M,  p  and  P*  can  be  defined  such  that  the  probability 
finding  composition  C(i )  at  conformation  P*  G  ^c(i)  is  p(i)Pi(Ri),  via 
p{i)  =  Xy-qcp))  p{x)dp{Y)  and  P*  :  7'1(C'(i))  -»•  R,x  ^  p{x)/p{i). 

With  the  help  of  these  probability  distributions,  the  following  lemma  now  sets  the  stage  for 
optimizations  in  chemical  space. 

Lemma  1.  For  a  bounded  property  O  :  V  — >  M  that  attains  its  global  optimum  exactly  once, 
the  optimal  nuclear  charge  distribution  is  a  distribution  of  discrete  nuclei  and  charges. 

Proof.  Let  Omin  :=  0(Xmin).  Then  the  expectation  value  for  O  is  a  linear  functional  of 
probability  distributions,  and  (O)  =  fr  p(x)0(x)dx  has  a  minimal  probability  distribution 
Pmin(x)  =  Here  <5r,v  is  the  Dirac  distribution  on  T,  i.e.,  the  probability 

distribution  on  T  such  that  Sr.x(y)  =  OVy  6  T\{A'}.  Hence,  (O)^  =  0(Xmln)  =  0,nin 
and  the  minimum  is  attainable.  Due  to  0  <  |  fr  p(x)(0(x)  —  Omin)dx\  < 
fr  I p(x)(0(x)  -  Omin)\dx  =  frp(x)(0(x)  -  Omin)dx  =  (O) p  -  Omin  =  (0)p  -  (0)Pmi7i,  there 
is  no  other  distribution  that  produces  a  lower  expectation  value.  Since  (O)  is  linear  and 
the  space  of  probability  distributions  is  convex,  the  minimum  is  global.  Applying  the 
argument  to  —  O  yields  the  same  for  maxima.  Since  Xmtn  is  a  single  finite  configuration, 
the  associated  nuclear  charge  distribution  is  just 

t(x 

min  ) 

qx„JR)=  V  q,S3(R  —  Rj),  (1) 

3= 1 

where  63  is  the  Dirac  distribution  on  M3.  □ 

Analogously  to  equation  1,  a  nuclear  charge  distribution  can  be  defined  for  any  probability 
distribution  p  on  T.  For  simplicity,  assume  that  Om%n  is  non-degenerate  and,  without  loss 
of  generality  in  the  following,  all  compounds  are  considered  to  be  neutral.  We  next  clarify 
nuclear  charge  distributions. 


3 


Definition  4.  The  nuclear  charge  distribution  q(R)  of  a  probability  distribution  p  on 
T  with  p  and  Pi  is  given  by 

,  <(C(i)) 

<z„(if)  :=  £>(;)  /  P<(x)  Y, 

The  expectation  charge  of  a  distribution  is  q  :=  fR3  q(R)dR.  A  non-negative  distribution  q 
is  called  configurationally  representable  if  and  only  if  there  exists  iGT  such  that 
Q  = 


In  order  to  perform  optimizations  over  these  distributions,  it  is  helpful  to  understand  the 
structure  of  the  distributions  derived  from  probability  distributions  on  T.  The  convex  space 
of  potential  nuclear  charge  distributions  Q  C  J2fi(M3)  is  restricted  to  such  distributions  that 
are  non-negative  on  M3,  whose  coordinate  systems  coincide  with  the  respective  principal 
axes  and  centers  of  mass,  and  for  which  q  >  1.  Since  the  space  of  probability  distributions 
is  convex  and  closed,  so  is  the  space  of  respective  nuclear  charge  distributions  Qr  C  Q. 

The  only  nuclear  charge  distribution  with  q  =  1  is  q(R)  =  S(R),  which  corresponds  to  the 
hydrogen  atom.  Hence,  the  set  of  non-negative  distributions  can  be  convexly  decomposed 
and  the  associated  projection  onto  nuclear  charge  distributions  is  itself  convex. 


With  this  characterization  it  is  possible  to  define  unambiguously  a  functional  on  the  space 
of  non-negative  distributions  for  a  given  bounded  property. 


Definition  5. 


0[q\  ■■  = 


min P:qp=q{(0)p},  3 p  :  qp  =  q 
oo,  otherwise 


This  functional  and  its  properties  are  our  main  objective  and  the  remainder  of  this  section 
is  dedicated  to  characterizing  this  functional.  The  following  lemma  guarantees  that  the 
minima  of  O  are  global. 

Lemma  2.  O  is  convex,  i.e.,  ()[\q\  +  (1  —  Ajqg]  <  XO[q\\  +  (1  —  \)0[q2\. 


Proof  0[\qi  +  (1  -  X)q2]  = 

(^)pi  +  (1  -  (0)P2)  =  ^0[q±\  +  (1 


min  (O)  <  min  (O) 

p:\qi+(l-\)q2=qP  P  Pi  ,P2'-qi=qP1  ,<?2=<?P2 


Api+(1-A)p2 


nun 

Pl,P2-qi=qP1  ,<?2=<?P2 


A  )0[q2 


□ 


Since  Omtn  =  (O)  n  <  { O) p  for  all  probability  distributions  p,  the  global  minima  of  O 
coincide  with  the  minima  of  O,  i.e.,  0[qPmin]  =  Omin  <  0[q ]  for  all  non-negative 
distributions  q.  Thus  it  is  possible  to  arrive  at  the  minimum  of  O  by  varying  q  and  we  have 
established  the  following  important  theorem. 
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Theorem  1.  For  a  bounded  property  O  that  attains  its  optimum,  there  exists  a  convex 
functional  of  non-negative  distributions  on  M3  that  optimizes  to  a  global  optimum  of  O. 

This  result  explains  the  successes  of  LCAP  and  VP-DFT  as  well  as  the  discrete 
optimization  approaches.  It  also  generalizes  the  recent  observation  that  optimization  of 
molecular  properties  is  considerably  more  successful  than  could  be  expected  considering 
the  vastness  of  the  search  space  (20).  Optimization  in  convex  spaces  is  fairly  efficient,  and 
the  reduction  of  the  optimization  space  to  distributions  in  only  three  dimensions  instead  of 
infinitely  many  is  a  considerable  simplification. 

Knowing  the  distribution  without  means  of  interpretation  is  not  particularly  useful.  But  if 
the  global  minimum  of  O  is  non-degenerate,  then  by  lemma  1,  qmin  corresponds  to  a  single 
molecule  or  molecular  system.  This  molecular  system  can  be  uniquely  identified. 

Corollary  1.1.  qopt  can  be  uniquely  mapped  to  a  composition  and  conformation. 

Proof.  Let  qopt(R )  =  YhiQi^R  ~  Ri)->  then  the  configuration  is  the  strictly  descendingly 
ordered  sequence  of  ((qi,Ri))i-  □ 

In  general  applications,  boundary  conditions  of  physical  viability  need  to  be  met.  The 
introduction  of  such  constraints  does  not  violate  the  convexity  arguments  put  forth  earlier, 
since  the  subspace  of  probability  distributions  that  are  zero  whenever  the  constraint  is 
violated  remains  a  convex  set.  Hence,  we  define  the  notion  of  physical  distributions  for 
common  requirements. 

Definition  6.  A  distribution  P  :  7_1(c)  — ■>  M  for  some  composition  c  =  ((g*,  Rf))i  is 
considered  physical  if  =  0  and  nl=i  7o(det  Jf)  >  0  for  all  x  G  7_1(c),  where  E 

denotes  the  energy  and  Jt  is  the  ith  principal  minor  of  the  Hessian  of  E  with  respect  to  the 
coordinates  Ri ,  and  rjo  is  the  Heaviside  step-function  with  value  0  at  0.  A  distribution  on  Y 
is  considered  physical  if  every  Pi  is  physical.  A  distribution  q  on  M3  is  considered 
physical  if  q  =  qp  for  some  physical  distribution  p  on  Y. 

The  energy  as  well  as  its  derivatives  are  properties  that  are  available  over  all  of 
configurational  space.  Hence,  they  can  be  described  as  convex  functionals  of  the  charge 
distribution.  Using  Lagrange  multipliers,  it  is  therefore  possible  to  incorporate  physicality 
constraints  using  these  respective  functionals.  Let  O  :  Y  — ►  R  be  a  property  of  interest  for 
optimization  bounded  from  below  (above).  In  this  case,  the  constrained  functional  becomes 

L  =  (L)p  =(0)p  +  Xg(G)p  +  Xh(J)P,  (2) 

where  G  is  the  gradient  condition  and  J  is  the  condition  of  positive-definiteness  of  the 
Hessian,  with  Lagrange  multipliers  \g  and  A h,  respectively. 
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3.  Approximating  0 


Since  the  exact  functionals  are  in  most  cases  unknown,  it  is  necessary  to  approximate  O.  It 
is  desirable  that  the  exact  functional  can  be  systematically  approximated  with  decreasing 
error.  The  following  lemma  proves  useful  in  characterizing  appropriate  substitutes,  as  it 
obviates  the  need  to  be  exact  on  anything  but  actually  physical  compounds. 

Lemma  3.  If  a  functional  W  of  charge  distributions  dominates  O  from  above  (below)  and 
W [q]  =  min  0(c)  for  all  configurationally  representable  distributions  q,  then  W 

{cer-.qc=q} 

optimizes  to  the  same  nuclear  charge  distribution  as  O. 

Proof.  W[q]  >  0[q]  >  Omin  (domination)  and  W[qmin]  =  0[qmin]  =  Omim  because  qmin  is  a 
configurationally  representable  distribution.  □ 


We  exemplify  the  use  of  lemma  3  in  the  development  of  an  approximate  operator  to 
minimize  a  non-degenerate  electronic  property  O,  e.g.,  the  inverse  of  the  dipole  moment, 
for  neutral  species  in  the  ground  state  using  only  the  nuclear  charge  distribution.  For  any 
configuration  c  E  T,  the  property  0(c)  takes  the  value  (V;  O  where  O  is  the 
corresponding  operator  on  the  space  of  wavefunctions  with  ||c||i  electrons  and 

if  G  WMl  is  the  normalized  ground-state  wavefunction  of  c.  We  define  W  such  that  it 
interpolates  linearly  between  species  with  integer-valued  numbers  of  electrons: 


W[q]  =  pi  (fji  6  i>i ^  +  p2  (i>2  O  +  A gG[q,  ipi,  f)2)  +  A hJ[q,  ipi,  ^2]  : 


E 

=  min  E 

L?J  1 

_  PW  3 

L  (i  J 

{V>36W^j} 

L  (i  J 

E 

—q,w  2 

=  min  E 

M  , 

—q,w  4 

L  d  J 

{V>4ewm} 

L  (i  J 

hi  [q\  +  h2  \q]  =  q, hi  +  h2  =  1 


(3) 

(4) 

(5) 

(6) 


The  three  constraints  under  consideration  in  this  example  are  as  follows: 


1.  The  ground-state  condition  stated  by  equations  4  and  5  and  energy  functional  E 

2.  The  gradient  condition  G 

3.  The  Hessian  positivity  condition  J 


We  delay  the  discussion  of  the  physicality  conditions  G  and  J  to  section  4  and  focus  on  the 
general  properties  of  W. 
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In  order  to  apply  lemma  3,  it  is  necessary  to  establish  that  W  dominates  O  and  coincides 
with  O  on  configurationally  representable  q.  The  ground-state  condition  obeys  the 
conventional  definition: 


1=1  11  11  1<J  J 

Q(X,Y ) 


-£'nuc[0']  — 


\x  -  Y\\ 


dXdY, 


Q(X,Y)  =  -q(X)q(Y)(l-8(Y-X)), 


E[q,jp\  =  (ip 

H 

l - 1 

|Cr<  ley 

pUj  /  (ip\ip)  +  Enuc 

‘M, 

-  <1  . 

E[q,tp]  =  Up 

H 

[M«l 

pUj  /  (0  0)  +  Enuc 

1 - 1 

^  iCr1 

i _ i 

E[q,  0i,  02]  =  fJ'iM.q,  0i]  +  qi2E[q,  02], 


(7) 

(8) 

(9) 

(10) 

(11) 

(12) 


where  fii,fi2  are  as  before.  The  Lagrange  multipliers  enforcing  the  gradient  and  Hessian 
conditions  guarantee  that  unphysical  q  are  mapped  to  infinity  as  they  are  for  O.  Hence, 
only  the  behavior  of  W  on  physical  q  is  of  interest.  If  q  is  physical  and  configurationally 
representable,  H[q\  and  Enuc  are  precisely  the  Hamiltonian  and  nuclear  repulsion  of  the 
corresponding  configuration  c.  Therefore,  W[q\  =  0(c)  for  physical,  configurationally 
representable  q,  fulfilling  the  second  requirement  of  lemma  3. 


It  remains  to  be  shown  that  W  dominates  O  for  physical,  not  configurationally 
representable,  distributions.  The  linear-interpolative  construction  of  W  with  respect  to  q 
reduces  the  discussion  to  arguments  on  integer-valued  q  only.  Without  loss  of  generality,  let 
q  be  an  admixture  of  two  distinguishable  physical,  configurations  x±,x2  G  T  with 
Il7(^i/2)||i  =  Q  and  ground-state  wave  functions  0Xl/2.  Choose  A  G  [0, 1],  then 
q  :=  A qXl  +  (1  -  A )qX2  and  E[qXi,ipj\  >  E[qXi,ipXi\,i  G  {1,2},  j  ^  i,  since  ^  cannot  be  the 
ground  state  simultaneously  for  both  distributions.  Therefore,  the  gradient  condition  is 
violated  and  the  Lagrange  multiplier  pushes  the  value  of  W  to  infinity.  Otherwise, 

W[q]  =  0(x i/2)  >  0[q],  Hence,  the  first  requirement  for  lemma  3  is  fulfilled  as  well.  Since 
Omin  coincides  with  a  configurationally  representable  qmtn,  IT  attains  Omtn  for  qmin,  and 
therefore,  IT  converges  upon  minimization  to  the  same  composition  and  conformation  as 
the  general  ensemble  method  does. 


4.  Enforcing  Physicality 


The  gradient  and  Hessian  conditions  will  be  developed  starting  from  a  configurationally 
representable  distribution.  First,  we  look  at  the  exact  gradient  functional  G  before 
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developing  an  approximate  G.  It  is  easily  discernable  that 


(13) 


where  we  have  used  Egs(x)  to  denote  the  ground-state  energy  and  Vj  is  the  gradient  of  the 
jth  nuclear  position  and  is  non-negative  and  zero  only  if  p  is  physical.  Given  a 
configuration,  the  gradients  are  derived  by  perturbing  infinitesimally  the  charge 
distribution  at  a  specific  point,  i.e.,  S(R  —  Ri)  \\x-r\\  ^  d(R  —  R%  ~  dRi)  \\x-r\\  •  The 
gradient  of  the  electron- nucleus  energy  is  then  taken  with  respect  to  the  direction  dRt. 
Hence,  the  gradient  is  a  small  variation  of  the  nuclear  charge  density.  Therefore, 


d*V  iEgS(x)  =  lim 
y  e— >o 


SEgsjqs r,J  {Y)qMY  -  Rj  +  ed)  -  5(Y  -  Ri)) dy 


r,x 


=dg(Ri)V^f[gJr’a](Ri)  =  (TVg(Ri)^f 

dqsrx 

where  d  is  a  direction  in  M3.  Thus,  we  can  rewrite  equation  13  as 


G[q]  =  min  /  p(x) 
p-qP=q  J 


=  min  /  p(x) 
p-qP=q  J 


d/i(M3)d/i(T) 


dp(M.3)dp(T). 


(14) 

(15) 


(16) 

(17) 


r,x 


Two  simple  approximations  given  wavefunctions  if} i/ip2  derive  then  by  dropping  all 
references  to  T  and  using  the  cumulative  nuclear  charge  distribution, 

1 2 


GAfhfoM  = 
G2[q^iM  = 


gV 


Vg 


dE[q,^  1,^2] 


5q 

fiE[q,i{>  1,-02] 


5q 


dyU(M3), 


d/i(M3), 


(18) 

(19) 


where  we  use  the  linear  interpolative  form  of  the  energy  as  used  in  equation  12.  Since  these 
expressions  are  non-negative  and  exact  for  configurationally  representable  nuclear  charge 
distributions,  G\/ 2  =  0  are  good  constraints.  To  satisfy  lemma  3,  the  question  remains 
whether  G 1  /2  may  be  zero  for  unphysical,  non-conhgurationally  representable  nuclear 
charge  distributions.  We  may  restrict  the  discussion  to  integer-valued  q  only.  Therefore, 
the  necessary  conditions  for  fulfilling  G\  with  the  ground-state  wave-function  0, 


Gi[qAA\  =  0 

q(X)  =  0  or  V8E[?M  =  0  a.e 
dq 


(20) 


may  be  considered  under  constraint  of  constant  q.  We  first  show  that  G\  =  0  is  only  given 
for  countable  sums  of  Dirac  distributions.  Then  we  show  that  these  are  composites  of 


physical  distributions.  If  the  set  N  =  M3  \  {A"  e  M3  :  q(X)  =  0}  has  non-zero  measure,  then 
is  a  constant  on  the  completion  of  N,  which  contradicts  that  E[q,ip,ip\  is  a  quadratic 
functional  of  q.  Therefore,  N  must  be  a  set  of  zero  measure  and,  hence,  q  is  a  series  of 
Dirac  distributions. 

As  shown,  G\  is  related  to  the  norm  of  the  first  variation  of  the  energy  with  respect  to  the 
nuclear  charge  distribution.  If  we  assume  without  loss  of  generality  that  this  distribution 
has  integer  charge,  is  not  configurationally  representable,  and  is  a  superposition  of  two 
distinct  configurationally  representable  distributions,  then  the  energy  functional  as  a 
function  of  the  convex  parameter  pi(n2  =  1  —  qii),  as  defined  before,  is  a  concave 
functional.  It  follows  that  the  gradient  condition  cannot  simultaneously  hold  with  the 
Hessian  condition  for  such  a  distribution. 

The  Hessian  condition  J  can  be  derived  analogously  to  G.  The  Hessian  analogue 

J[q]  =  min  j  p(x)  J  qx(X)qx(Y)S  ^  dXdYdp(V )  (21) 

Ji[q]  =  I  q(X)q(Y)S  dXdY  (22) 

where  S(M)  :=  Yl'i=i  Vo(~  det  Mi),  Adi  is  the  ith  minor  of  a  3  x  3-matrix  Ad,  and  770  is  the 
Heaviside  function,  and  is  positive  semi-definite  for  a  configurationally  representable  energy 
minima.  Extending  finite-dimensional  criteria  to  bilinear  forms  as  in  equation  21  and 
invoking  the  gradient  condition  renders  the  result  that  the  Hessian  matrix  for  the 
configurationally  representable  distribution  q  is  only  non-negative  semi-definite  if  J  =  0. 

We  have  thus  demonstrated  the  equivalence  of  using  G\  and  J\  to  the  gradient  and  Hessian 
conditions  for  use  in  constrained  optimizations  on  physical  distributions. 


5.  Conclusions 

We  have  laid  out  a  general  framework  for  the  optimization  of  properties  of  chemical 
compounds  using  only  nuclear  charge  distributions.  The  unknown  exact  functional  is 
convex,  which  can  be  exploited  for  efficient  optimization  in  discrete  as  well  as  continuous 
approaches.  Furthermore,  the  convexity  remains  even  for  constrained  problems.  In  absence 
of  the  exact  functional,  we  have  derived  an  approximate  functional  for  the  optimization  of 
an  electronic  property  under  the  constraint  of  physical  viability.  Ongoing  efforts  to  develop 
general  purpose  electronic  basis  sets,  e.g.,  wavelets  (22-24)  or  finite  element  methods 
(25-2 7)  open  up  the  potential  to  use  these  same  bases  for  efficient,  systematic  explorations 
of  chemical  space  via  nuclear  charge  distributions. 
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